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PCT09 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/763,712 



DATE: 03/07/2001 
TIMK: 13:13:29 



3 
5 
7 
9 
9 
9 
10 
12 
14 
15 
16 
17 
19 
20 
21 
23 
24 
25 
26 
.27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
3B 
39 
40 
41 
42 
4 3 
44 
45 
46 

4 7 
48 
49 
50 
51 
52 

5 3 
54 



Input Set ; A:\37157SEQ.txt 

Output Sf^t: N:.\CRF3\0307200i\I763712 .raw 

<110> APPLICANT: Wakainiya, N. 

<120> TITLE OE" INVENTION: Novel Collectin 

<130> FILE REFERENCE: 19036/37157 

<14 0> CURRENT APPLICATION NUMBER: US/09/76 3,712 
<141> CURRENT FILING DATE 2001-02 -26 

<150> PRIOR APPLICATION NUMBER: JP HEl .10-237611 
<151> PRIOR FILING DATE: 199B-08-24 
<160> NUMBER OF SEQ ID NOS : 32 
<210> SEQ ID NO; 1 
<211> LENGTH: 2024 
<212> TYPE: DNA 
<2:i3> ORGANISM: Homo Sapiens 
<220> FEATURE: 
<221> NAl-IE/KEl': 
<22 2> LOCATION: 
<4 00> SEQUENCE: 



ENTERED 



CDS 

(670) . . (1695) 
1 



qbcacgaatc tgcagcaaga taccagcgtg ctccagggca atctgcagaa ccaaatgtat 
tctcataatg tggtcatcat gaacctcaac aacct:gaacc tgacccaggt gcagcagagg 
aacctcatca cgaatctgca gcggtctgtg yatgacacaa gccaggctat: ccagcgaatc 
aagaacgact tUcaaaatct gcagcaggtt tttcttcaag ccaagaagga cacggattgg 
ctgaaggaga aagtgcagag cttgcagacg ctggctgcca acaactctgc gttggccaaa 
gccaacaacg acaccctgga ggatatgaac agccagctca actcattcac aggtcagatg 
gagaacatoa ccactatctc tcaagccaac gagcagaacc tgaaagacct gcaggactta 
cacaaagatg cagagaatag aacagccatc aagttcaacc aactggagga acgctitccag 
ctctttgaga cggatattgt gaacatcatt agcaatatca gttacacagc ccaccacctg 
cggacgctga ccagcaatct aaatgaagtc aggaccactt gcacagatac ccttaccaaa 
cacacagatg atctgacctc cttgaataat accctggcca acatccgttt ggattctgtt 
tctctcagq abg caa caa gat ttg atg agg teg agg tta gac act gaa gta 
Met Gin Gin Asp Leu Met Arg Ser Arg Leu Asp Thr Glu Val 
15 10 . 

gcc aac tta t:ca gtg att atg gaa gaa atg aag eta gta gac tec aag 
Ala Asn Leu Ser Val He Met Glu Glu Met Lys Leu Val Asp Ser Lys 
15 20 25 30 

cat ggt cag etc ate aag aat ttt aea ata eta caa ggt cea ccg ggc 
His Gly Gin Leu He Lys Asn Phe Thr He Leu Gin Gly Pro Pro Gly 

35 40 45 

ccc fsgg ggt cea aga ggU gac aga gga tec cag gga ccc cet ggc cca 
Pro Arg Gly Pro Arg Gly Asp Arg Gly Ser Gin Gly Pro Pro Gly Pro 

50 55 60 

act ggc aac aag gga cag aaa gga gag aag ggg gag cct gga cca cct 
Thr Gly Asn Lys Gly Gin Lys Gly Glu Lys Gly Glu Pro Gly Pro Pro 

65 70 75 

ggc cct gcg ggt gag aga ggc cca att gga cea get ggt ccc ccc gga 
Gly Pro Ala Gly Glu Arg Gly Pro He Gly Pro Ala Gly Pro Pro Gly 

80 85 90 

gag cgt ggc ggc aaa gga tct aaa ggc tec cag ggc ccc aaa ggc tec 
Glu Arg Gly Gly Lys Gly Ser Lys Gly Ser Gin Gly Pro Lys Gly Ser 



60 
120 
180, 
240 
300 
360 
4 20 
480 
540 
600 
66,0 
711 



759 



0O7 



855 



903 



951 



999 
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RAW SEQUENCE LISTING DA'Hi: 03/07/2001 

PATENT APPLICATION: US/09/76 3,712 TIMfc:: 13:13:29 



Input Set : A:\37157SEQ. txt 

Output Set: N:\CRF3\03072001\l763712.raw 



55 


95 










10 0 










XU -) 










110 




56 


cgt 


ggt 


tec 


cct 


ggg 


aag 


ccc 


ggc 


cct 


cag 


ggc 


ccc 


a g t 


ggg 


gac 


cca 


104 7 


57 


Arg 


Gly 


Ser 


Pro 


Gly 


Lys 


Pro 


Gly 


Pro 


Gin 


Gly 


Pro 


Ser 


Gly 


Asp 


Pro 




5 8 










115 










120 










125 






59 




ccc 


ccg 


gg*^ 


cca 


cca 


ggc 


aaa 


gag 


yga 


etc 


ccc 


ggc 


cct 


cag 


ggc 


10 95 


60 


Gly 


Pro 


Pro 


Gly 


Pro 


Pro 


Gly 


Lys 


Glu 


Gly 


lieu 


Pro 


Gly 


Pro 


Gin 


Gly 




61 








13 0 










13 5 










140 








62 


cct 


cct 


ggc 


ttc 


cag 


gga 


ctt 


cag 


ggc 


acc 


gtt 


ggg 


gag 


cct 


ggg 


gtg 


1 1 yl 

.L r 4 J 


63 


Pro 


Pro 


Gly 


Phe 


Gin 


Gly 


Leu 


Gin 


Gly 


Thr 


Val 


Gly 


G 1 u 


Pro 


ury 


V a .1 




64 






14 5 










150 










XDD 










65 


cct 


t 

gqa 


cct 


egg 


gga 


ctg 


cca 


ggc 


ttg 


cct 


ggg 


gta 


cca 


ggc 


a tg 


cca 


±r y J. 


66 


Pro 


Gly 


Pro 


Arg 


Gly 


Le u 


Pro 


G].y 


Leu 


Pro 


G ly 


Va 1 


P t'O 


Gly 


Met 


Pro 




67 




160 










165 










170 












68 


ggc 


ccc 


aag 


ggc 


ccc 


ccc 


ggc 


cct 


cet 


ggc 


cca 


tea 


gga 


gcg 


gtg 


gtg 


12 39 


69 


Gly 


Pro 


Lys 


Gly 


Pro 


Pro 


Gly 


Pro 


Pro 


Gly 


Pro 


Ser 


Gly 


Ala 


Va 1 


Val 




70 


175 










180 










18 5 










ion 
J, y u 




71 


ccc 


ctg 


gcc 


ctg 


cag 


aat 


g^ig 


cca 


ace 


ccg 


gca 


ccg 


gag 


gac 


aat 


ggc 


T T 13 "7 
± z 0 / 


72 


Pl'O 


Leu 


Ala 


Leu 


Gin 


Asn 


Glu 


Pro 


Thr 


Pro 


Ala 


Pro 


Glu 


Asp 


Asn 


Gly 




73 










19 5 










200 










2 05 






74 


tgc 


ccg 


cct 


cac 


tgg 


aag 


aac 


ttc 


aca 


gac 


aaa 


tgc 


tac 


tat 


ttt 


tea 


13 3 5 


75 


Cys 


Pro 


Pro 


His 


Trp 


Lys 


Asn 


Phe 


Thr 


Asp 


Lys 


Cys 


Tyr 


Ty r 


Phe 


Ser 




76 








210 










215 










220 








77 


gtt 


gag 


aaa 


gaa 


a tt 


ttt 


gag 


gat 


gca 


aag 


ctt 


ttc 


tgt 


gaa 


gac 


aag 


13 8 3 


78 


Val 


Glu 


Lys 


Glu 


lie 


Phe 


Glu 


Asp 


Ala 


Lys 


Leu 


Phe 


Cys 


Glu 


Asp 


Lys 




79. 






225 










230 










235 










80 


tct 


tea 


cat 


ctt 


gtt 


ttc 


ata 


aac 


act 


aga 


gag 


gaa 


cag 


caa 


tgg 


a ta 


1, 4 3 1 


Bl 


Ser 


Set: 


H Is 


Leu 


Val 


Pile 


I le 


Asn 


Thr 


Arg 


Glu 


Glu 


Gin 


Gin 


Trp 


I le 




82 




240 










24 5 










250 












83 


aaa 


aaa 


cag 


atg 


gta 


ggg 


aga 


gag 


age 


cac 


tgg 


ate 


ggc 


etc 


aca 


gac 


14 79 


84 


Lys 


Lys 


Gin 


Met 


Val 


Gly 


Arg 


Glu 


Ser 


His 


Trp 


He 


Gly 


Leu 


Thr 


Asp 




85 


255 










260 










265 










270 




86 


tea 


gag 


cgt 


qaa 


aat 


gaa 


tgg 


aag 


tgg 


ctg 


gat 


ggg 


aca 


tct 


cca 


gac 


15 27 


87 


Ser 


Glu 


Arg 


Glu 


Asn 


Glu 


Trp 


Lys 


Trp 


Leu 


Asp 


Gly 


Thr 


Ser 


Pro 


Asp 




8H 










275 










280 










285 






89 


tac 


aaa 


aat 


tgg 


aaa 


get 


gga 


cag 


ccg 


gat 


aac 


tgg 


ggt 


eat 


ggc 


cat 


1575 


90 


Tyr 


Lys 


Asn 


Trp 


Lys 


Ala 


Gly 


Gin 


Pro 


Asp 


Asn 


Trp 


Gly 


H Is 


Gly 


His 




91 








290 










295 










300 








92 


ggg 


oca 


qga 


gaa 


gac 


tgt 


get 


ggg 


ttg 


att 


tat 


get 


ggg 


cag 


tgg 


aac 


1623 


93 


Gly 


Pro 


Gly 


Glu 


Asp 


Cys 


Ala 


Gly 


Leu 


lie 


Tyr 


Ala 


Gly 


Gin 


Trp 


Asn 




94 






305 










310 










315 










95 


gat 


ttc 


caa 


tgt 


gaa 


gac 


gte 


aat 


aac 


ttc 


att 


tgc 


gaa 


aaa 


gac 


agg 


1671 


96 


Asp 


Pile 


Gin 


Cys 


Glu 


Asp 


Val 


Asn 


Asn 


Phe 


He 


Cys 


Glu 


Lys 


Asp 


Arg 




97 




320 










325 










330 












98 


gag 


aca 


g ta 


ctg 


tea 


tet 


gca 


tta 


taacggactg tgatgggatc acatgagcaa 


1725 


99 


Glu 


Thr 


Val 


Leu 


Ser 


Ser 


Ala 


Leu 





















100 335 340 

101 attttcaget etcaaaggca aaggacactc ctttctaatt gcatcacctt ctcatcagat 1785 

102 tgaaaaaaaa aaaageactg aaaaccaatt actgaaaaaa aattgacagc tagtgttttt 1845 

103 taceatccgt cattacceaa agacttggga actaaaatgt tccccagggt gatatgctga 1905 
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RAW SEQUENCE LISTING 

PATENT APPLTCATTON: US/09/763,712 



DATii: 03/0 7/2001 
TIME: 13:13:29 



Input: Set : A:\3 7157SEQ.txt 

Output Set: N:\CRF3\03072001\l763712.raw 



104 ttttcattgt gcacatggac tgaatcacat agattctcct ccgtcagtaa ccgtgcgatt 196 5 

105 otacaaaLta tgtcttccaa agtatggaac acLccaatca gaaaaaggtt atcatcccg 2024 
10 7 <210> SfJQ ID NO: 2 

10 8 <211> LhJNGTH; 547 

109 <212> TYPE: PRT 

110 <213> ORGANISM; Homo Sapiens 

112 <220> FEATURE: 

113 <223> OTHER INEORMATION : Deduced Amino Acid Sequence of: Novel Collectin from Nucleotide 

114 Sequence 

116 <4 00> SEQUENCE: 2 

117 Met Tyr Ser His Asn Val Val lie Met Asn Leu Asn Asn Leu Asn Leu 
118. 1 5 10 15 

119 Thr Gin Val Gin Gin Arg Asn Leu lie Thr Asn Leu Gin Arg Ser Val 

120 20 25 30 

121 Asp Asp Thr Ser Gin Ala lie Gin Arg lie Lys Asn Asp Phe Gin Asn 

122 35 40 ■ 45 

12 3 Leu Gin Gin val Phe Leu Gin Ala Lys Lys Asp Thr Asp Trp Leu Lys 
124 50 55 60 

12 5 Giu Lys Val Gin Ser Leu Gin Thr Leu Ala Ala Asn Asn Ser Ala Leu 
126 65 70 75 80 

12 7 Ala Lys Ala Asn Asn Asp Thr Leu Giu Asp Met Asn Ser Gin Leu Asn 
128 85 90 95 

12 9 Ser Phe Thr Gly Gin Met Giu Asn lie Thr Thr lie Ser Gin Ala Asn 

130 100 105 110 

131 Giu Gin Asn Leu Lys Asp Leu Gin Asp Leu His Lys Asp Ala Giu Asn 

132 115 120 125 

13 3 Arg Thr Ala lie Lys Phe Asn Gin Leu Giu Giu Arg Phe Gin Leu Phe 
134 130 135 140 

13 5 Giu Thr Asp lie Val Asn lie lie Ser Asn He Ser Tyr Thr Ala His 
136 145 150 155 160 

13 7 His Leu Arg Thr Leu Thr Ser Asn Leu Asn Giu Val Arg Thr Thr Cys 

138 165 170 175 

139 Thr Asp Thr Leu Thr Lys His Ttir Asp Asp Leu Thr Ser Leu Asn Asn 

140 180 185 190 

141 Thr Leu Ala Asn lie Arg Leu Asp Ser Val Ser Leu Arg Met Gin Gin 

142 195 200 205 

14 3 Asp Leu Met Arg Ser Arg Leu Asp Thr Giu Val Ala Asn Leu Ser Val 
144 210 215 220 

14 5 He Met Giu Giu Met Lys Leu Vai Asp Ser Lys His Gly Gin Leu Tie 
146 225 230 235 240 

14 7 Lys Asn Phe Thr He Leu Gin Gly Pro Pro Gly Pro Arg Gly Pro Arg 
148 245 250 255 

14 9 Gly Asp Arg Gly Ser Gin Gly Pro Pro Gly Pro Thr Gly Asn Lys Gly 

150 260 265 270 

151 Gin Lys Gly Giu Lys Gly Giu Pro Gly -Pro Pro Gly Pro Ala Gly Giu 

152 275 280 285 

15 3 Arg Gly Pro He Gly Pro Ala Gly Pro Pro Gly Giu Arg Gly Gly Lys 
154 290 295 300 

15 5 Giy Ser Lys Gly Ser Gin Gly Pro Lys Gly Ser Arg Gly Ser Pro Gly 
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RAW SEQUENCE LISTING DATE: 03/07/2001 

PATl=JNT APPLICATION: US/09/763,712 TIME: 13:13:29 

Input Set : A:\37157SEQ.txt 

Output Set: N:\CRP3\0307200i\I763712.raw 



156 


305 






310 










315 










320 


157 


Lys Pro Gly 


Pro 


Gin 


Gly 


Pro 


Ser 


Gly 


Asp 


Pro 


Cly 


Pro 


Pro 


Gly 


Pro 


i5y 






325 










330 










3 3 5 




159 


Pro Gly Lys 


Glu 


Gly 


Leu 


Pro 


Gly 


Pro 


G ]. 11 


Gly 


Pro 


Pro 


Gly 


Phe 


Gin 


160 




34 0 










34 5 










350 






161 


Gly Leu Gin 


Gly 


Thr 


Val 


Gly 


Glu 


Pro 


Gly 


Va 1 


Pro 


Gly 


Pro 


Arg 


Gl y 


162 


355 










360 










36 5 








16 3 


Leu Pro Gly 


Leu 


Pro -Gly 


Va 1 


Pro 


Gly 


Met 


Pro 


Gly 


Pro 


Lys 


Gly 


Pro 


164 


37 0 








37 5 










38 0 










165 


Pro Gly Pro 


Pro 


Gly 


Pro 


Ser 


Gly 


Ala 


Val 


Val 


Pro 


Leu 


Ala 


Leu 


Gin 


166 


385 






390 










395 










4 00 


167 


Asn Glu Pro 


Th r 


Pro 


Ala 


Pro 


Glu 


Asp 


Asn 


Gly 


Cys 


Pro 


Pro 


His 


Trp 


168 






4 05 










410 










415 




169 


Lys Asn Phci 


Thr 


Asp 


Lys 


Cys 


Tyr 


Tyr 


Phe 


Ser 


Val 


Glu 


Lys 


Glu 


He 


170 




4 20 










42 5 










4 30 






171 


Phe Glu A'sp 


AlfJ 


Lys 


Leu 


Phe 


Cys 


Gl u 


Asp 


Lys 


Ser 


Ser 


His 


Leu 


Val 


17 2 


435 










4 40 










44 5 








173 


Phe He Asn 


Thr 


Arg 


Glu 


Glu 


Gin 


Gin 


Trp 


He 


Lys 


Lys 


Gin 


Met 


Val 


174 


450 








4 5 5 










460 










175 


Gly Arq Glu 


Ser 


His 


Trp 


He 


Gly 


Leu 


Thr 


Asp 


Ser 


Glu 


Arg 


Glu 


Asn 


176 


465 






470 










475 










480 


177 


Glu Trp Lys 


Trp 


Leu 


Asp 


Gly 


Thr 


Ser 


Pro 


Asp 


Xyr 


Lys 


Asn 


Trp 


Lys 


17 8 






485 










490 










495 




179 


Ala Gly Gin 


Pro 


Asp 


Asn 


Trp 


Gly 


His 


Gly 


His 


Gly 


Pro 


Gly 


Glu 


Asp 


180 




500 










505 










510 






181 


Cys Ala Gly 


Leu 


He 


Tyi: 


Ala 


Gly 


Gin 


Trp 


Asn 


Asp 


Phe 


Gin 


Cys 


Glu 


182 


515 










520 










525 








18 3 


Asp Val Asn 


Asn 


Phe 


He 


Cys 


Glu 


Lys 


Asp 


Arg 


Glu 


Thr 


Val 


Leu 


Ser 


184 


530 








5 35 










54 0 










185 


Ser Ala Leu 




























18 6 


545 




























188 


<210> SEQ ID NO; 


3 
























189 


<211> LENGTH: 27 
























190 


<212> TYPE: 


PRT 


























191 


<213> ORGANISM: 


Artiticial Sequence 
















1 9 3 


<22 0> FEATURE: 


























194 


<22 3> OTHER 


INFORMATION: 


: Modified Consensus 


Sequence of 


collecti 


195 


Coliectin 


























197 


<400> SEQUENCE: 


3 
























198 


Glu Lys Cys 


val 


Glu 


Met 


Tyr 


Thr 


Asp 


Gly 


Lys 


Trp 


Asn 


Asp 


Arg 


Asn 


199 


1 




5 










10 










15 




200 


Cys Leu Gin 


Ser 


Arcj 


Leu 


Ala 


He 


Cys 


Glu 


Phe 












2U1 




20 










25 
















20 3 


<210> SEQ IC 


) NO: 


4 
























204 


<211> LENGTH 


I: 21 
























205 


<212> TYPE: 


DNA 


























206 


<213> ORGANISM: 


Artificial Sequence 
















208 


<220> FEATURE: 


























209 


<223> OTHER 


INFORMATION: 


: Sequence of a Reverse Primer for Screen 
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RAW SEQUENCE LISTING 

PATKWT APPi/ICATION: US/09/7 6 3, 712 



DATE: 03/07/2001 
TIMHl: 13:13:29 



Input: Set : A:\37157SEQ.txt 

OiiLpuL Set: N:\CRF3\03072OOi\l763712 .raw 



211 
212 



<4 00> SEQUENCE: 4 
caatctgciLg agnaggtgat g 



21 



2:14 <210> SPJQ ID NO: 5 

215 <211> LENGTH; 21 

216 <212> TYPE: DNA 

217 <213> ORGANISM: Artificial Sequence 

219 <220> FEATURE: 

220 <223> OTHER INFORMATION: Sequence ol a Forward Primer for Screening a Novel Collectin. 
22 2 <4 00> SEQUENCE: 5 

22 3 Qcgaggggct ggatgggaca t 21 
225 <210> SEQ ID NO: 6 

2 26 <211> LENGTH: 2 7 
2 27 <2:i2> TYPE: PR'J' 

228 <213> ORGANISM: Artificial Sequence 

23 0 <220> FEATURE: 

231 <223> OTHER INFORMATION: Consensus sequence of three collectins v/hich were reported 
23 2 heretofore 
234 <400> SEQUENCE: 6 

23 5 Glu Asp Cys Val Leu Leu Leu Lys Asn Gly Gin Trp Asn Asp Val Pro 
23(5 1^5 10 15 

23 7 Cys Ser Thr Ser His Leu Ala Val Cys Glu Plie 
238 20 25 

240 <210> SEQ ID NO: 7 

241 <211> LENGTH: 24 
'24 2 <212> TYPE: DNA 

24 3 <213> ORGANISM: Artificial Sequence 
24 5 <2 20> FEATURE: 

24b <223> OTHER INFORMATION: Ml 3 Universal Primer Sequence for Sequencing 

24 8 <400> SEQUENCE: 7 

249 cgacgttgta aaacgacggc cagt 24 
2 51 <210> SEQ ID NO: 8 

252 <211> LENGTH: 17 

253 <212> TYPE: DNA 

2 54 <213> ORGANISM: Artificial Sequence 

25 6 <2 20> FEATURE: 

257 <223> OTHER INFORMATION: Ml 3 Reverse Primer Sequence for. Sequencing, 

259 <4 00> SEQUENCE: 8 

260 caggaaaca gctatigac 17 
262 <210> SEQ ID NO: 9 

26 3 <211> LENGTH: 24 
2 64 <212> TYPE : DNA 

26 5 <213> ORGANISM: Artificial Sequence 
267 <2 2 0> FEATURE: 

26 8 <2 2 3> OTHER INFORMATION: Sequence of a lambda gtll Reverse Primer for Sequencing. 
2 70 <4 0 0> SEQUENCPJ: 9 

271 ttgacaccag accaactggt aatg 24 

27 3 <210> SEQ ID NO: 10 

274 <211> LENGTH: 24 

275 <212> TYPE: DNA 
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VERIFICATION SUMMARY DATE: 03/07/2001 

PATENT APPLICATION: US/09/763,712 TIME: 13:13:30 

Input Set : A:\37157SEQ.txt 

Output Set: N:\CRF3\03072001\I763712.raw 

L:9 M:270 C: Current Application Number differs, Replaced Current Application No 
L:9 M:271 C: Current Filing Date differs > Replaced Current Filing Date 
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